Isolation Levels for Data Sharing in Large-Scale Scientific Workflows
نویسندگان
چکیده
Scientists can benefit from Grid and Cloud infrastructures to face the increasing need to share scientific data and execute data-intensive workflows at a large scale. However, these workflows are creating more and more challenging problems in the automation of data management during execution. Existing workflow management systems focus on how data is stored, transfered and on data provenance. However they lack in managing isolation during the execution of tasks of the same or different workflows that read/update shared data. In this scope, we propose three isolation levels taking into account data provenance and multiversioning. In the best of our knowledge this is the first proposal in such context.
منابع مشابه
Staging-Based Data Management for Extreme Scale Coupled Scientific Workflows
Advanced scientific workflows running at extreme scale on high end computing platforms are providing new capabilities and new opportunities for insights in a wide range of application domain. These workflows compose multiple simulation, data analysis and other application components that require data sharing and exchange at runtime. However, due to the increasing data volumes and associated I/O...
متن کاملEffective and efficient similarity search in scientific workflow repositories
Scientific workflows have become a valuable tool for large-scale data processing and analysis. This has led to the creation of specialized online repositories to facilitate workflow sharing and reuse. Over time, these repositories have grown to sizes that call for advanced methods to support workflow discovery, in particular for similarity search. Effective similarity search requires both high ...
متن کاملScience gateway technologies for the astrophysics community
The availability of large-scale digital surveys offers tremendous opportunities for advancing scientific knowledge in the astrophysics community. Nevertheless the analysis of these data often requires very powerful computational resources. Science Gateway technologies offer web-based environments to run applications with little concern for learning and managing the underlying infrastructures th...
متن کاملInformation flow analysis of scientific workflows
Recently, scientific workflows have emerged as a platform for automating and accelerating data processing and data sharing in scientific communities. Many scientific workflows have been developed for collaborative research projects that involve a number of geographically distributed organizations. Sharing of data and computation across organizations in different administrative domains is essent...
متن کاملDR-SWDF: A Dynamically Reconfigurable Framework for Scientific Workflows Deployment in the Cloud
Workflows management systems (WfMS) are aimed for designing, scheduling, executing, reusing, and sharing workflows in distributed environments like the Cloud computing. With the emergence of e-science workflows, which are used in different domains like astronomy, life science, and physics, to model and execute vast series of dependents functionalities and a large amount of manipulated data, the...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2017